Post-transcriptional regulation of noise in protein distributions during gene expression 
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The intrinsic stochasticity of gene expression can lead to large variability of protein levels across 
a population of cells. Variability (or noise) in protein distributions can be modulated by cellular 
mechanisms of gene regulation; in particular, there is considerable interest in understanding the role 
of post-transcriptional regulation. To address this issue, we propose and analyze a stochastic model 
for post-transcriptional regulation of gene expression. The analytical solution of the model provides 
insight into the effects of different mechanisms of post-transcriptional regulation on the noise in 
protein distributions. The results obtained also demonstrate how different sources of intrinsic noise 
in gene expression can be discriminated based on observations of regulated protein distributions. 



The intrinsic stochasticity of biochemical reactions in- 
volved in gene expression can lead to large variability of 
protein levels across a clonal population of cells 1] . The 
need to regulate this variability (or noise) in protein dis- 
tributions places important constraints on cellular path- 
ways; in particular those that bring about global changes 
in gene expression. In such pathways, recent research has 
increasingly highlighted the role of post-transcrvptional 
control by proteins or regulatory small RNAs[2i]. An 
important question that arises is: what is the role of 
post-transcriptional regulation in controlling the noise in 
protein distributions? This work addresses the preced- 
ing question in the context of stochastic models of gene 
expression as discussed below. 

To elucidate the source of intrinsic noise in protein dis- 
tributions, two coarse-grained stochastic models (Fig.l 
A,B) have been proposed 0]. In one case, mRNA syn- 
thesis is modeled as a Poisson process (Poisson scenario) 
and high variability in protein levels is related to low 
abundance and infrequent synthesis of mRNAs [4|. In 
the other case (Telegraph scenario), the promoter is as- 
sumed to switch between active and inactive states, with 
mRNA synthesis occuring in bursts only when the pro- 
moter is active [B-Sl ■ The Poisson and Telegraph scenario 
models both explain experimental observations of noise 
in protein distributions |9l-lll| in terms of 'bursts' of pro- 
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tein expression; however they make distinct predictions 
for the underlying mRNA burst distribution (Fig.l). For 
the Poisson scenario model, each burst of protein expres- 
sion arises from translation from a single mRNA; whereas 
for the Telegraph scenario model, multiple mRNAs can 
contribute to a single burst of protein expression. How- 
ever, observations of protein burst distributions alone do 
not uniquely identify the underlying mRNA burst dis- 
tribution and thus cannot be used to infer the source 
of intrinsic noise in gene expression [l2!|. A currently 
open question is whether this observation remains valid 
when protein burst distributions are modulated by post- 
transcriptional regulation. 

In this paper, we develop a framework for modeling 
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FIG. 1. (A) Schematic representation of the Poisson scenario 
for mRNA synthesis. Observed bursts correspond to proteins 
produced from a single mRNA. (B) The Telegraph scenario 
assumes that DNA transitions between an 'off' and 'on' state. 
mRNA synthesis only occurs in the 'on' state and mutiple 
mRNAs can be produced per burst. (C) The Poisson scenario 
reaction scheme for protein synthesis. 



post-transcriptional regulation of genes expressed accord- 
ing to the Poisson and Telegraph scenarios. The analyt- 
ical results obtained provide insight into how different 
mechanisms of post-transcriptional regulation modulate 
noise in protein distributions. The results obtained also 
show how different sources of intrinsic noise in gene ex- 
pression can be discriminated based on observations of 
regulated protein burst distributions. 

We first analyze how post-transcriptional regulation 
modifies protein bursts under the Poisson scenario. Sev- 
eral recent studies have focused on the corresponding 
mean-field and stochastic models, in particular for reg- 
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ulation by small RNAs 



These studies have pri- 



marily focused on small RNAs which act via irreversible 
stoichiometric degradation of mRNAs. On the other 
hand, binding of the post-transcriptional regulator can 
more generally be considered as a reversible reaction. For 
this case, and in the limit of large regulator concentra- 
tions (see below) , we wish to analyze the effects of differ- 
ent mechanisms of post-transcriptional regulation on the 
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noise in protein burst distributions. 

The proposed reaction scheme for our model is shown 
in Fig. (2 A) for a given concentration of the post- 
transcriptional regulator: the regulator binds mRNA to 
form a complex with rate a; the dissociation rate for 
the complex is (3. The parameters kpi and kp2 are the 
rates of protein production from the mRNA in free and 
bound states and and are the corresponding decay 
rates. For global post-transcriptional regulators that are 
present in large numbers, it is a good approximation that 
binding to the target mRNA does not significantly alter 
the concentration of the regulator. In this case, fluctua- 
tions in regulator concentration can be neglected and the 
rate a can be taken to be constant. 

For the Poisson scenario, the protein burst distribu- 
tion, Pb{n), corresponds to the number of proteins trans- 
lated from a single mRNA before it decays. During this 
process, the mRNA can exist in two states: either free or 
bound in a complex with the post-transcriptional regula- 
tor. Correspondingly, we define the functions fi (n, t) and 
/2(n, t) (generalizing the approach outlined in [16|) which 
denote the probabilities of finding the mRNA in free and 
bound states respectively at time t, having produced a 
burst of n proteins. The initial condition corresponds to 
creation of the mRNA in its free state at i = 0. Now, 
the burst distribution Pb{n) can be obtained from /i(ri, t) 
and /2(?^, t) as 



fi{n,t)^imdt+ / f2{n,t)^icdt (1) 
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Furthermore, the time evolution of fi(n,t) and f2{n,t) 
is determined by the following Master equations: 



dh{n,t) 
dt 

dt 



= kpiifiin - - fi{n,t)) 
- {^jn + a)fi{n,t) + I3f2{n,t) 

= kp2(f2{n - - f2{n,t)) 

-{iic + P)f2{n,t)+ah(n,t) (2) 



The above equations can be analyzed further using a 
combination of generating functions and Laplace trans- 
forms. Specifically, defining Gb{z) = J2n^"'^bin) and 
Fiai^, s) = J2n ^" ir e"''*/i,2(", t) dt, we obtain 



Gb{z) = lim(^,„i^i(z,s) 



(3) 



We now consider the Telegraph scenario wherein mul- 
tiple mRNAs can be produced during a single burst. In 
this case, the probability of having m mRNAs in one 
burst, P{m), is given by a geometric distribution, condi- 
tional on the production of at least 1 mRNA 



P{m) = {l-v.^Y 



Pn 



(4) 



Eq.Q serves as a general formula for the mRNA burst 
distribution characterizing both Poisson and Telegraph 




Protein 



B. 



0.02 



Full Repression 

-o- Decay Modulation 
-•- Activation 



0.00 




200 



n 



400 



FIG. 2. (A) Kinetic scheme for regulation of protein produc- 
tion by a mRNA-binding regulator. (B)The protein burst 
distribution Ph{n) for full repression, decay modulation and 
activation from Eq.lO. In all three cases, the mean of the 
protein burst distribution is kept the same. The burst distri- 
bution for full repression is identical to the geometric distribu- 
tion with the same mean, whereas Pf,{n) for decay modulation 
and activation deviates significantly from the geometric dis- 
tribution. Parameters for decay modulation and activation 
are chosen such that — — 5, 



/3 



kpi 



= 1, -j^ = 5, p,„ ^ 1 and 

H'Tn 



4, pm ~ 1 respectively. 



scenarios. The case Pm — 1 correspond to a single 
mRNA produced every burst (Poisson scenario); whereas 
if Pm < 1, the mean number of mRNAs produced per 
burst rab{= 1/pm) is greater than 1 (Telegraph scenario). 
In general, each mRNA will produce a random number 
of proteins drawn from the distribution Pb{n) (the cor- 
responding generating function Gb{z) is given by Eq.(I3])) 
and furthermore the number of mRNAs in the burst is 
also a random variable defined by the distribution Eq. (|3]) . 
We denote the distribution of proteins produced from all 
the mRNAs in the burst by Pb{n). The corresponding 
generating function G'f^{z) is given by 



G'b{z) 



Gb{z)p„ 



1 - Gb{z)il ~ Pm)' 



(5) 



Evaluation of the functions Fi^2{z,s) in combination 
with Eq. ([U and Eq. ([3]) then leads to the exact expression 
for G"{,(z), which can be written as 



G'b{^)=x'—^ + il-X)'-^ 
z — bi z — b2 



(6) 



where 



_ , A - (fcpl(/3 + A^c) + kp2{a - ^ImPra)) 

X — 1= 

2VA 

^ _ fcpi(/3 + ^c) -I- kp2{a + UmPm) ± VA 

Ikpi kp2 

A = {kpi{(3 + He) - kp2(a + flmPm))'^ 

+ Aakpikp2{l3 + fic - PcPm)- 



(7) 



The above expression indicates that the distribution 
of proteins produced in a single burst (Pliji)) can be 
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expressed as a weighted sum of two geometric distribu- 
tions. While the complete expression for Pl{n) can thus 
be derived from the results obtained, in some cases, the 
primary interest is in derived quantities characterizing 
the noise in protein distributions. For example, several 
studies have focused on the noise strength (or Fano Fac- 
tor) all rib [H- For the protein burst distribution Pl,{n), 
both the mean and the noise strength can be obtained 
from the generating function as 



a 



rib 



^ 1 

rib 



fib 



{akp2 + kpiP -I- kpiHc)iafic + Pfim + McMn 



(8) 



It is noteworthy that Eq. ([7]) is valid for the most gen- 
eral choice of parameters. To gain additional insight, let 
us consider specific parameter choices of interest. For ex- 
ample, taking the limit a — corresponds to the unreg- 
ulated protein burst distribution. In this case, we obtain 



G'b{z) 



fJ-r. 



Z) 



(9) 



which corresponds to the generating function of a ge- 
ometric distribution in agreement with previous studies 
Il|,ll2,ll7|,ll8| . Of greater interest is the effect of different 
modes of regulation. While a generally accepted model 
is that regulator binding prevents ribosome accesss (i.e. 
kp2 = 0), recent studies have shown that small RNAs 
can also repress gene expression by binding in the coding 
regioii significantly downstream of the ribosome binding 
site pjf- In the latter case, regulator binding is not ex- 
pected to affect the translation rate, but instead alters 
the mRNA decay rate. To explore the effects of these 
different regulatory mechanisms on the noise in protein 
distributions, we consider two special cases for the gen- 
eral results derived above: 1) full repression {kp2 — 0) 



and 2) decay modulation (k 



'p2 



For full repression (in the limit fc^ 



■p2 



0), we have 



G'biz) = 



-(1-z) 



(10) 



The result is identical to Eq.® provided that the mRNA 



degradation rate is rescaled from /i,„ to /i„ 



Thus 



the protein burst distribution remains a geometric distri- 
bution but with a reduced mean due to lowering of the 
effective mRNA lifetime. This implies that regulation by 
full repression results in a protein burst distribution that 
is identical to that of an unregulated burst distribution 
with the same mean. 

On the other hand for regulation by decay modulation, 
the burst distribution shows deviations from a geometric 



distribution (Fig. 2B). To analyze this further, let us 
focus on the noise strength cr'^/rib in Eq.® which, for 
decay modulation, is given by 



1 



rib 



2akpi{l-9i) 



rib (a + P + eifi^){f3 + 0i{a + mri)) 

= l + nb + Q (11) 

where 6i — P^m > 1 find the term Q quantifies the 
deviation from the geometric distribution ((5 = for a 
geometric distribution). Thus for regulation by decay 
modulation, the noise strength can be tuned by the pa- 
rameter 6i resulting in a burst distribution with reduced 
variance when compared an unregulated burst distribu- 
tion with the same mean. Eq. (|11[) indicates that this re- 
duction can be significant since the maximum magnitude 
for ^ is 0.5. Such a narrowing of the variance relative 
to the mean has been previously proposed as a potential 
function for small RNAs with important implications for 
canalization of gene expression during development [20| . 

The previous results for repression mechanisms can be 
contrasted with the effect of post-transcriptional activa- 
tion of gene expression. The burst distribution for acti- 
vation also shows significant deviations from a geometric 
distribution with the same mean (Fig. 2B). For activa- 
tion due to increased protein production (with = 
the deviation Q relative to the burst mean is given by: 



Q_ _ 2a02PmfJ'mid2 - 1) 

rib {a92 + P + Mm)^ 



(12) 



where 02 = kp2/kpi. As ^2 > 1 for activation, the noise 
will be greater than that of an unregulated burst distri- 
bution with the same mean. The value of depend- 
ing on the choice of 9i and a, can be made arbitraily 
large. Our results thus indicate that activation of gene 
expression by small RNAs can potentially lead to large 
variance in protein distribution, which in turn can give 
rise to phenotypic heterogeneity that is often beneficial 
for the organism (2l| . 

Eq. pTI) also illustrates conditions under which the 
Poisson and Telegraph scenarios can be distinguished 
based on observations of protein burst distributions. The 
unregulated burst distribution is geometric and thus 
completely determined by its mean value rib = ^(^)- 
Since there is effectively one measurable quantity {rib) for 
the burst distribution, pm canot be determined given that 
is not known 12 1. Hence the Poisson and Telegraph 
scenarios cannot be distinguished in this case. However, 
for the case of decay modulation (with kpi = kp2), we 
have an additional measurable quantity: Q. It is of in- 
terest to note that Q depends on , but is independent 
of Pm- Thus, measurements of Q and rib can be used to 
determine both ^ and Pm and thereby to discriminate 
between the Poisson and Telegraph scenarios. 

The argument above provides a means of determin- 
ing Pm provided the interaction parameters such as a, 
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/? and 9i are known. In general, these parameters are 
not known, however for regulators such that the dissoci- 
ation rate — > 0, the following protocol can be used to 

determine Pm- (i) Obtain the mean protein burst levels 
without regulation, denoted by np. (ii) Choose a certain 
regulator concentration. Obtain the mean protein burst 
level Ubi and the corresponding variance. Determine 
the deviation from a geometric distribution as defined 
in Eg. dTlT) . which is denoted by Qi and let ni = nbi/riQ. 
(iii) Change the concentration of the regulator, which 
effectively changes the regulator binding rate a. Re- 
peat step (ii) and obtain the corresponding quantities 
denoted by Q2 and ri2 = "62/?^o- Given the five quanti- 
ties no, 1,2 and Qi,2, the mean transcriptional burst size 
mb(= 1/pm) is given by: 

Qi(l-n2)-Q2(l-ni) 



rub 



-2nQnin2 
(1 



Qini(l - 712) 
ni){l - n2)(ni - 



- (92^2(1 - ni) 
^2) 



Qini{l - 77,2)2 - (52^2(1 - ni)2 

Using stochastic simulations, we have verified that the 
above expression accurately predicts the degree of tran- 
scriptional bursting. It should be noted that experimen- 
tal approaches have been developed recentl y fo r direct 
measurements of mRNA burst distributions 22, 23| and 



it would be informative to compare results from these di- 
rect approaches with estimates from the above protocol. 

Finally, we note that the result derived above lead to 
corresponding analytical expressions for steady-state pro- 
tein distributions over a population of cells. Recent work 
has shown that, when protein lifetimes are much longer 
than mRNA lifetimes, an effective Master equation can 
be written down for proteins alone 17, T^. In this ap- 
proximation, given a geometric distribution for protein 
bursts, the corresponding steady-state protein distribu- 
tion is a negative binomial distribution Given the 
most general burst distribution obtained above, we de- 
rive that the corresponding steady-state distribution is 
a convolution of two negative binomial distributions. A 
detailed analysis of the corresponding expressions for the 
mean and variance will be presented elsewhere. 

In summary, we have derived analytical expressions 
which characterize the noise in protein distributions for 
a stochastic model of post-transcriptional regulation. It 
is noteworthy that the expressions provide functional 
forms for the entire probability distribution (and not just 
the mean and variance) for arbitrary parameter choices. 
This knowledge can be a useful input for approaches to 
infer cellular mechanisms and parameters based on entire 
distributions 24, [2^. The results also provide insight 



into how different mechanisms of post-transcriptional 
regulation can be used to fine-tune the noise in stochastic 
gene expression with potential implications for studies 
addressing on the evolutionary importance of noise in 
biological systems \2d\. In some limits, the modulated 
burst distributions can be used to infer the degree of 



transcriptional bursting and hence to determine the 
source of intrinsic noise in gene expression. The results 
derived can serve as building blocks for future studies 
focusing on regulation of stochastic gene expression. 
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